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CHAPTER  1 


INTRODUCTION 

1.1  Introduction 

In  today's  construction  industry  where  approximately  15%  of  all 
contractors  are  failing  (4:59),  and  where  litigation  and  arbitration 
are  increasingly  used  as  avenues  of  restoration  (3:391),  new  management 
tools  and  methods  are  required  to  increase  survivability.  Analytical 
procedures  can  be  used  to  improve  planning  of  a  forthcoming 
construction  job.  Trouble-shooting  during  the  course  of  construction 
can  be  accomplished  using  analytical  procedures.  Additionally,  if 
claims  cannot  be  avoided  on  a  job,  analytical  procedures  could  be  used 
to  compare  what  actually  happened,  to  what  should  have  happened. 

However,  analytical  procedures  tailored  to  the  needs  of  the 
construction  manager  are  scarce.  It  is  therefore  the  objective  of  this 
thesis  to  tailor  one  analytical  method  for  use  by  the  construction 
industry.  The  technique  chosen  for  research  is  the  modeling  and 
interpretation  of  construction  data  using  regression  analysis. 

Linear  regression  is  a  tool  which  has  been  available  to  the 
construction  industry  for  a  long  time,  yet  has  not  been  utilized  for  a 
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variety  of  reasons.  Based  on  this  writer’s  experience  and  the  courses 
taken  during  masters  studies  at  The  Ohio  State  University,  four  areas 
of  confusion  exist: 

1)  Data  acquisition:  The  data  necessary  for  linear 
regression  is  not  usually  collected.  Chapter  2  of  this 
thesis  will  present  some  of  the  reasons  and  possible 
cures. 

2)  Model  formulation:  A  methodology  for  the  formulation 
of  possible  models  is  lacking  in  the  construction 
industry.  A  proposed  methodology  is  the  subject  of 
Chapter  3. 


3)  Model  testing:  Several  techniques  are  available  for 
the  development  of  the  "best"  model.  A  few  of  these 
will  be  reviewed  and  utilized  in  Chapter  4. 


4)  Interpretation 

of 

results:  The  utility 

of 

the 

exercise  depends 

on 

the  interpretation 

of 

the 

regression  results 

• 

Chapter  5  will  present 

the 

interpretation  of 

the 

results  derived  from 

the 

data 

analyzed. 

1.2  Two  Types  of  Response  Variables 

Construction  studies  generally  involve  labor  and  material 
quantities.  Two  circumstances  exist  in  such  studies: 

1)  The  quantity  of  work  in  a  given  period  is  fixed  and 
the  labor  required  (input)  is  a  random  variable;  and 

2)  The  labor  available  to  do  the  work  is  fixed  and  the 
quantity  (output)  is  a  random  variable. 


3 

In  this  study,  the  data  was  obtained  from  a  project  in  which  the  first 
condition  held.  In  particular,  this  study  will  concern  concrete  work 

I 

in  which  the  quantitiy  to  be  placed  varied  from  day  to  day.  The 
quantity  for  each  day  was  known  and  fixed  prior  to  starting  the  day's 
work.  The  question  to  be  examined  may  be  stated  as  "How  many  men 
(manhours)  are  required  to  complete  a  given  day's  pour?  Hence  the 
dependent  variable  is  manhours. 

1.3  3oals  of  Study 

The  goals  of  this  study  are  threefold: 

1)  To  present  a  methodology  or  approach  to  developing 
a  regression  model; 

2)  To  show  how  one  may  predict  and  call  important 
information  from  a  final  model  in  order  to  make 
decisions  on  process  selection;  and 

3)  To  show  how  one  may  select  a  crew  size  which  is 
sufficient  to  complete  the  work  without  excess  idle 
time  using  said  model  and  associated  descriptive 


statistics 


CHAPTER  2 


DATA  COLLECTION 

2.1  Introduction 

Initial  analysis  is  generally  done  on  whatever  data  is  available. 
Upon  completion  of  this  analysis,  the  analyst  might  suggest  collection 
of  additional  types  of  data.  This  may  require  changes  in  collection 
techniques  such  as  additional  field  forms  or  proper  completion  of 
present  forms.  In  order  to  effectively  use  regression  analysis,  a  data 
set  including  all  possibly  relevant  variables  is  required.  An 
incomplete  data  set  will  yield  a  large  random  error  term  which  consists 
of  unexplained  error,  and  error  due  to  excluded  variables.  "The 
ordinary  least  squares  estimates  of  the  parameters  are  unbiased  only 
when  all  theoretically  specified  variables  are  included  in  the 
regression. "(8:82).  A  complete  data  set  will  result  in  a  more 
meaningful  model  to  the  analyst.  In  general,  the  construction  industry 
does  not  keep  the  right  records  from  which  a  complete  data  set  can  be 
extracted.  A  typical  construction  accounting  form  is  shown  in  Figure 
2.1.  This  form  is  used  by  The  United  States  Air  Force  to  account  for 
manhours  expended  on  in-house  construction. 

Through  examination  of  the  form  in  Figure  2.1,  it  is  noted  that 
items  of  interest  to  the  construction  manager  are  missing.  Other 
potentially  interesting  variables  include  crew  size,  temperature,  floor 
(elevation),  method  of  construction,  material  type,  area,  and  volume. 
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Figure  2.1  Air  Force  Labor  Accounting  Form  (10) 


An  iterative  process  consisting  of  collecting  data,  analyzing  the 
data,  then  collecting  more  data,  often  exists.  Variables  found 
important  in  previous  analysis  along  with  new  variables  to  be  tested, 
are  collected  in  subsequent  studies.  This  iteration  continues  until  it 
becomes  uneconomical  to  do  so,  or  until  the  analyst  chooses  to  halt. 
It  is  noted  that  it  is  always  less  expensive  to  collect  the  data  as  the 
process  is  going  on  rather  than  to  hire  a  consultant  to  piece  the 
information  together  at  a  later  date. 

2.2  Data  Used  in  This  Study 

A  complete  data  set  made  available  to  this  writer  will  be  analyzed 
and  reviewed  in  this  thesis.  The  data  was  extracted  from  work  account 
records  of  a  concrete  contractor's  performance  on  elevated  concrete 
slab  pours  of  a  45  story  building.  The  data  is  listed  in  Appendix  A. 
Labels  and  definitions  of  the  variables  are  given  in  Appendix  B.  These 
data  are  very  complete  and  provide  an  example  of  a  good  data  collection 
system  in  construction. 

In  order  to  avoid  confusion,  a  description  of  data  from  one 
workday  will  be  presented.  In  particular,  observation  one  in  Table  2.1 
will  be  explained. 

DATE  is  given  in  month/day/year  format.  For  this  work  day,  the 
date  is  November  22,  1983.  DAY  refers  to  the  day  of  the  contract.  Day 
one  of  the  contract  is  March  8,  1983.  The  elevation  above  ground  level 
(EL)  is  zero  feet  for  this  observation,  and  the  area  of  the  pour  (AREA) 
is  17,300  square  feet.  The  gross  volume  (V0LGR0SS)  of  concrete  poured 
is  345  cubic  yards.  The  minimum  temperature  (MINTEMP)  is  55  degrees 
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Fahrenheit  with  no  precipitation  (PRECIP).  The  method  of  placement  is 
truck  pumping  (TRUKPUMP  -  1,  PLACBUGY  =0).  The  slab  is  float  finished 
(FLOTFIN  *  1)  by  the  cement  finishers,  and  the  fixed  overhead 
(FIXEDOH) ,  which  consists  of  supervisor  costs  and  an  occasional  police 
officer,  is  $1,483.  The  pour  is  not  made  in  the  Kodiak  (K0D  =0)  area 
of  the  building,  nor  was  it  part  of  the  atrium  (ATR  *  0).  Workers  had 
to  assemble  75  feet  of  pipe  (ASSLEN)  to  facilitate  the  pumping 
operations.  This  particular  pour  required  100.5  manhours  of  concrete 
laborers  (LAB0RWH)  and  51  manhours  of  cement  finishers  (FINISWH). 

The  population  of  this  study  consists  of  all  elevated  concrete 
slab  pours  constructed  in  this  project.  Due  to  the  size  and  nature  of 
the  data,  all  the  data  was  utilized.  No  random  sampling  was  done. 
This  particular  project  ran  for  more  than  nine  months  and  involved  over 
$10  million  worth  of  concrete  work.  • 

After  some  preliminary  work  with  graphics  and  descriptive 
statistics,  a  change  in  modus  operandi  (5),  or  change  in  methods,  was 
identified.  Upon  inquiry  of  the  data  source,  it  was  discovered  that  a 


truck  pump  was  used  in  addition  to  the  stationary  pump  which  was  the 
standard  method  of  placement.  This  additional  information  was  added  to 


the  data  set.  Another  class  of  work,  slab  on  grade  (as  opposed  to 
elevated  slab)  was  included  in  the  original  data  set.  The  slab  on 
grade  pours  were  removed.  Outliers  were  identified  and  investigated, 
and  all  errors  in  data  entry  were  corrected. 

2.3  Summary 

Construction  data  historically  has  not  been  examined  by  linear 
regression.  Field  data  has  generally  been  limited  to  that  which  was 
required  by  accountants.  Reluctance  of  field  personnel  to  record  data 
and  failure  by  management  to  request  indepth  data  have  contributed  to 
the  problems  in  availability  of  data.  One  data  set  will  be  examined  in 
this  study  to  illustrate  the  methods  proposed  in  this  thesis. 


CHAPTER  3 


MODEL  FORMULATION 

3.1  Introduction 

The  purpose  of  this  chapter  is  to  introduce  a  standardized 

approach  fcr  the  formulation  of  a  model.  Any  approach  should  provide 
for  useful  interpretation  of  the  intercept  term  and  predictor 

variables.  This  project  includes  variables  at  the  ratio,  interval,  and 
binary  levels.  Each  type  of  variable  requires  a  different  type  of 
analysis. 

3.2  Three  Predictor  Variable  Groups 

Ratio-level  variables  have  an  absolute  minimum  value  of  zero. 

They  are  typically  real,  positive  variables  which  are  objective  in 

nature  such  as  area  and  volume. 

Interval-level  variables  have  arbitrarily  chosen  zero  points. 
These  variables  have  values  relative  to  an  arbitrary  point,  and 
include  variables  like  temperature  and  elevation. 

Binary-level  variables  have  values  of  either  zero  or  one.  These 
variables  have  a  value  of  0  for  an  observation  at  standard  condition 
such  as  "concrete  pumped"  (as  opposed  to  "direct  chute"  or  "crane  and 
bucket"),  and  a  value  of  1  when  conditions  are  other  than  standard. 

3.3  Standardization 

Often  the  construction  manager  is  interested  in  the  effect 
deviation  from  the  standard  condition  has  on  a  process.  This  is 
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some  standard  was  caused  by  an  external  force.  In  the  analysis  of  the 
data  presented  in  this  thesis,  deviation  from  the  average  condition  of 
the  interval-level  variables  will  be  examined.  The  standardization 
equation  which  will  be  used  is: 

XCS)  -  (X-X),  (3.1) 

where  X(S)  is  denoted  as  X-standardized; 

X  is  the  interval-level  variable;  and 

X  is  the  arithmetic  mean,  or  some  standard  value  of  X. 

By  definition,  an  interval-level  variable  has  an  arbitrary  zero  point, 
thus  standardization  will  change  the  intercept  and  not  the  slope. 

3.4  Draft  Model  Formulation 

Independent  variables  can  be  chosen  and  combined  into  a  draft 
model.  The  first  consideration  is  to  search  for  intercept 
combinations.  The  intercept  term  is  confounded  and  comprised  of 
mobilization,  demobilization,  and  a  contingency  buffer.  Interpretation 
of  the  intercept  term  is  further  complicated  because  ratio-level 
variables,  when  equal  to  zero,  are  in  most  cases  outside  the  data 
range.  The  intercept  term  is  a  defined  constant  when  all  ratio-level 
variables  are  zero,  i.e.,  no  concrete  has  yet  been  poured. 
Interval-level  variables  must  be  adjusted  when  standard  conditions  do 
not  exist.  This  is  most  easily  done  by  standardizing  the  variables; 
hence  all  interval-level  variables  will  be  standardized  using  Equation 
3.1. 

Adjustments  to  the  intercept  term  are  accomplished  by  adding  all 
binary  and  interval-level  variables  thought  to  affect  the  intercept. 
The  generic  formula  is: 

A-A0+ZBiA|+ZC| (X  -X), 


(3.2) 
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where  A  is  the  adjusted  intercept  term; 

A0  is  the  defined  constant; 

3|Ai  are  the  products  of  the  binary-level  variable 
and  its  coefficients;  and 

C;  (X  -X)  are  the  products  of  the  interval-level 
variable  and  its  coefficients. 

This  equation  can  be  transformed  into  the  form: 

A=A0[1+  bA  +  c.  (X  -X)].  (3.3) 

The  coefficients  of  Equation  3.3  are  not  the  same  as  the  coefficients 
in  Equation  3.2. 

Development  of  the  independent  variables  is  similar  to  the 
approach  used  in  developing  the  intercept  form.  Ratio-level  variables 
are  combined  with  binary  and  interval-level  variables  in  the  form: 

D-ZCVZJ  P(X  -X)]+  E(Zm).  (3.4) 

where  D  is  the  adjusted  coefficient; 

E(  is  the  coefficient  of  Zm; 

Zm  is  the  ratio-level  variable  being  developed; 

FnAn  are  the  binary-level  variables  affecting  Zm;  and 

Gp(X  -X)  are  the  interval-level  variables  affecting  7^. 
Transformation  is  accomplished  by  dividing  Equation  3.4  by  each  E,  Zm. 
It  should  be  emphasized  that  the  binary-level  and  interval-level 
variables  used  in  Equation  3.4  are  variables  which  are  considered  by 
the  analyst  to  affect  each  ratio-level  variable  chosen.  This  does  not 
preclude  the  use  of  other  variables  by  themselves. 

The  final  draft  model  combines  Equations  3.3  and  3.4  and  other 
chosen  variables  to  take  the  form  of: 
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Y=A0+Z3jAi+ZC1  (X  — X)+  ZmE|+Z(FnAn*Zm)+Z[3p(X  -X)*Zj. 


(3.5) 


This  equation  requires  transformation  in  order  to  ease  interpretation. 


The  transformed  equation  is: 

Y-A0[l+Zb.<V  +ZCj  (X  -X)]+E,  *2m[l+ZfnAn+Zgp(X  -X)] . 


(3.6) 


Summarizing  the  alternate  approach,  the  analyst  must  choose  from 
available  data  the  variables  which  may  be  helpful  in  explaining  or 
predicting  a  particular  construction  process.  All  interval-level 
variables  are  then  standardized  using  Equation  3.1.  Under  standard 
conditions,  all  binary-level  variables  are  set  at  zero.  The  intercept 
is  developed  using  interval  and  binary-level  variables  which 
intuitively  influence  the  mobilization,  demobilization,  and  contingency 
buffer.  Equation  3.2  is  used  to  develop  the  intercept  term.  After  the 
independent  variables  are  chosen,  they  are  combined  with  any 
influencing  variables  in  accordance  with  Equation  3.4.  The  draft  model 
is  then  transformed  to  facilitate  interpretation. 

3.5  Example  Model  Formulation 

Three  independent  variables  are  of  concern  to  the  construction 
manager  of  this  project.  The  laborer  workhours,  cement  finisher 
workhours,  and  fixed  overhead  models  will  be  developed  to  demonstrate 
the  concepts  introduced  in  this  chapter. 

Table  3.1  illustrates  the  variables  chosen  to  be  formulated  for 
the  prediction  of  LABORWH.  These  variables  were  chosen  from  this 
writer’s  experience  and  knowledge  of  concrete  operations.  DAY  is 
chosen  because  "learning  affect"  is  often  discussed  in  the  literature. 
The  elevation  may  be  significant  because  the  laborers  must  construct 
and  disassemble  the  pumping  pipe,  EL  and  ASSLEN  are  thus  included  in 
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Table  3.1  Chosen  Variables  for  the  Prediction  of  LA3CRWH 


VARIABLE 

TYPE 

CONTRIBUTES  TO 

LA3EL 

NEWDAY 

INTERVAL 

INTERCEPT 

STANDARDIZED  TIME 

ELSTAND 

INTERVAL 

INTERCEPT 

STANDARDIZED  ELEVATION 

TEMP 

INTERVAL 

INTERCEPT 

STANDARDIZED  TEMPERATURE 

PRECIP 

BINARY 

INTERCEPT 

PRECIPITATION 

KOQ 

BINARY 

INTERCEPT 

EODIAK  POUR 

ATR 

BINARY 

INTERCEPT 

ATRIUM 

PLAC3UGY 

3INARY 

INTERCEPT 

BUGGY  PLACEMENT 

TRUKPUMP 

BINARY 

INTERCEPT 

TRUCE  PUMP  USED 

VOLGROSS 

RATIO 

WORE  DONE 

GROSS  VOLUME 

VOLXOD 

PRODUCT 

WORE  DONE 

VOLGROSS  TIMES  EOD 

VOLATR 

PRODUCT 

WORE  DONE 

VOLGROSS  TIMES  ATR 

VOLDAY 

PRODUCT 

WORE  DONE 

VOLGROSS  TIMES  NEWDAY 

TVOLGROS 

PRODUCT 

WORE  DONE 

TEMP  TIMES  VOLGROSS 

PVOLGROS 

PRODUCT 

WORE  DONE 

PRECIP  TIMES  VOLGROSS 

V0L3UGY 

PRODUCT 

WORE  DONE 

PLAC3UGY  TIMES  VOLGROSS 

VOLTRUCX 

PRODUCT 

WORE  DONE 

TRUEPUMP  TIMES  VOLGROSS 

ASSLEN 

RATIO 

WORE  DONE 

PIPE  ASSEMBLY  LENGTH 

ELA3SLEN 

PRODUCT 

WORE  DONE 

ASSLEN  TIMES  ELSTAND 

TASSLEN 

PRODUCT 

WORE  DONE 

ASSLEN  TIMES  TEMP 

PA3SLEN 

PRODUCT 

WORE  DONE 

ASSLEN  TIMES  PRECIP 

AREAVOL 

PRODUCT 

WORE  DONE 

AREA  TIMES  VOLGROSS 

■  k  *  »'»*>’  kN  »** 


transformations  to  form  the  complete  transformation.  The  total 
transformation  form  is  accomplished  by  adding  the  intercept  and 
ratio-level  transformations  to  other  predictor  (AREAVOL  in  this  case) 
variables  which  may  effect  the  response  variable.  The  complete  model 
including  the  transformation  form  is  included  in  Appendix  D.  The 
response  variables  FINISWH  and  FIXEDOH  are  developed  and  transformed  in 
similar  fashion.  The  complete  models  are  included  in  Appendices  E  and 
F  respectively. 

3 . 5  Summary 

The  recommended  approach  to  formulating  regression  models 
accomplishes  two  things:  1.  It  proposes  a  standardized  method  of  model 
formulation;  and  2.  The  proposed  formulation  method  eases  the 
interpretation  of  the  regression  results.  The  predictor  variables  can 
be  broken  into  three  groups  necessary  to  accomplish  the  proposed  model 
formulations.  The  independent  variable  LABORWH  was  developed  as  an 
example.  Similar  procedures  were  used,  though  not  illustrated,  for  the 
development  of  FINISWH,  and  FIXEDOH. 
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CHAPTER  4 
MODEL  TESTING 

4.1  Introduction 

Cnee  the  models  have  been  formulated,  the  dead'.vood  or  useless 
variables  should  be  removed.  The  goal  of  regression  analysis  is  to 
build  a  simple  yet  meaningful  model.  Model  testing  is  used  to 

accomplish  this  goal.  Some  model  testing  methods  and  staristics  which 

2  2 

are  in  common  use:  stepwise  regression;  t-test;  E-test;  R  ;  ?.AOj  5 
Mallow's  Cp  statistic;  collinearity  diagnostics;  and  influence 
diagnostics.  Important  information  can  also  be  extracted  from  residual 
plots.  Ludolph  (6:31)  discusses  the  use  of  residual  plots  in  his  MS 
thesis.  All  of  the  above  tests  and  statistics  are  indicators  of  lack 
of  fit  of  a  model.  It  is  important  to  note  that  lack  of  fit  is  a 
relative  concept.  Standard  software  packages  exist  which  will  perform 
the  tests  necessary  to  run  a  regression  analysis.  SAS  (1)  will  be  used 
throughout  this  study. 

4.2  Traditional  Approach 

Traditional  hypothesis  testing  procedures  proposed  by 
statisticians  involve  selecting  variables  and  their  forms  (exponential, 
logrithmic,  etc.)  through  experience,  or  upon  suggestion.  This 
entails:  1.  Choosing  independent  and  dependent  variables;  2.  Selecting 
"in-place"  values  (observations  with  values  of  x  and  y)  versus 


selecting  random  assignment  (find  a  value  of  y  from  an  assigned  x 
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value);  3.  Choosing  a  number  of  observations  to  use;  and  4. 
Formulating  a  hypothesis.  Parameters  are  then  estimated  and  the 
hypothesis  is  tested. 

A  hypothesis  formulated  before  analysis,  based  on  theoretical 
ideas  is  refered  to  as  an  a  priori  hypothesis.  After  testing  an  a 
priori  hypothesis  through  regression,  the  regression  results  can  be 
used  to  formulate  a  new  hypothesis,  or  an  a  posteriori  hypothesis  (9). 
A  new  data  set  is  required  to  test  the  a  posteriori.  This  procedure  is 
repeated  until  the  "best"  model  is  formed.  There  may  not  be  a  best 
model.  The  traditional  approach  requires  much  data  to  satisfy  the 
assumptions  of  regression. 

A  hypothesis  formed  by  the  traditional  approach  on  this  data  set 
is  illustrated  in  Appendix  C  of  this  thesis.  The  dependent  variable 
being  tested  is  LABOR’J/K. 

4.3  Testing  LA30RWH 

The  computer  output  for  the  analysis  of  LABORWH  is  attached  in 
Appendix  D.  The  thought  process  used  in  developing  the  final  model 
will  be  discussed  in  this  section;  a  tableau  of  test  results  is 
provided  in  Table  4.1.  The  subsequent  response  variables  will  be 
briefly  presented  with  inclusion  of  a  tableau  similar  to  Table  4.1. 

Due  to  the  large  number  of  variables  formulated  in  the  draft  model 
for  predicting  labor  workhours,  the  data  was  screened  using  stepwise 
regression.  An  a  level  of  0.25  was  chosen.  Thirteen  variables  met  the 
0.25  significance  level  and  were  input  into  a  regression  run.  The 
initial  regression  run  yielded  a  R^Qj  of  0.6732,  and  a  mean  square 
error  of  148.374.  The  predictor  variable  PAS3LEN  is  the  least 
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A.  A*  A  A  _  A-  4*  f  ■sJ"  »  *  "V"  of* 


REGRESSION 

RUN 

VARIABLE 

REMOVED 

MSE 

R2 

1 

- 

148.374 

0.7049 

0.6732 

2 

PASSLEN 

149.443 

0. 7013 

0.6719 

3 

AREAVOL 

149.927 

0.6979 

0.6703 

*4 

ELSTAND 

150.293 

0.6947 

0.6700 

5 

KOD 

155.003 

0.6325 

0.6597 

*  INDICATES  "BEST"  MODEL 
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significant  in  the  t-test,  and  is  thus  removed  in  the  second  regression 
run.  Run  two  shows  a  slight  drop  in  the  R^0J  and  a  small  rise  in  the 
MSE.  By  the  t-test  criteria,  AREAVOL  is  removed  and  the  regression  is 
run  again.  The  RAoj  and  MSE  again  experiences  a  slight  drop  and  rise 
respectively.  These  changes  have  been  slight  thus  far  which  indicates 
an  insignificance  of  the  removed  variables.  ELSTAND  is  removed  and  a 
fourth  regression  is  run.  The  results  of  this  run  are  similar  to 
previous  runs.  The  fifth  run  with  KOD  removed  results  in  significant 
changes  in  the  indicators  used  to  this  point;  the  model  chosen  is  thus 
that  tested  in  regression  run  four. 

The  collinearity  diagnostics  indicate  the  degree  of 
multicollinearity  present  in  the  model.  Table  4.2  illustrates  the 
results  of  collinearity  diagnostics  on  the  final  model  for  the 
prediction  of  LABORWH.  Of  particular  interest  is  the  conditional 
index.  When  the  conditional  index  is  large,  estimates  of  the  response 
variable  may  have  considerable  amounts  of  numerical  error.  It  has  been 
suggested  that  a  bottom  line  conditional  index  less  than  30  indicates 
an  acceptable  model  (9).  The  conditional  index  of  16.432  in  Table  4.2 
indicates  the  final  LABORWH  model  is  fair. 

The  influence  diagnostics  provide  a  measure  of  the  influence  each 
observation  has  on  the  parameter  estimates  of  the  model.  Appendix  D 
contains  the  influence  diagnostics  for  LABORWH.  The  important 
diagnostics  are  provided  by  examining  the  DFBETAS.  This  statistic 
indicates  the  effect  the  removal  of  a  particular  observation  has  on 
each  of  the  parameter  estimates,  scaled  by  the  standard  error.  A  value 
for  a  DFBETA  of  0.5  or  more  is  cause  for  some  concern. 


Table  4.2  Collinearity  Diagnostics  of  LABORWH  Model 


COLLXNEABIIT  DIAGNOSTICS  VARIANCE  PROPORTIONS 


condition 

PORTION 

PORTION 

PORTION 

PORTION 

RUBBER 

EIGENVALUE 

INDEX 

INTERCEP 

TEBP 

ROD 

VOLGROSS 

1 

4.600 

1.000 

0.0006 

0.0031 

0.0005 

0.0009 

2 

2.073 

1.265 

0.0042 

0.0032 

0.001 3 

0.0050 

3 

1  .002 

1  .598 

0.0003 

0.0000 

0.0309 

0.0020 

tt 

0.942679 

2.209 

0.0000 

0.0096 

0.0008 

0.0010 

S 

0.272347 

4.110 

0.0193 

0.0773 

0.0000 

0.0056 

6 

0.217170 

4.602 

0.0007 

0.0119 

0.0034 

0 .0047 

7 

0.102940 

6.664 

0.0391 

0.0202 

0.2113 

0.1268 

6 

0.072460 

7.967 

0.0570 

0.5359 

0.0107 

0.0037 

9 

0.057462 

8.947 

0.0183 

0.1726 

0.6422 

0.0464 

10 

0.043434 

10.291 

0.2071 

0.01 26 

0.0720 

0.6217 

11 

0.017034 

16.432 

0.6454 

0.1534 

0.0266 

0.1 622 

PORTION 

PORTION 

PORTION 

PORTION 

PORTION 

PORTION 

PORTION 

NUHBER 

VOLKOO 

VOLDAX 

TVOLGROS 

VQLTKUCR 

ASSLEN 

ELASSLEN 

TASSLEN 

1 

0.0005 

0.0063 

0.0027 

0.0000 

0.0007 

0.0039 

0.0030 

2 

0.0014 

0.0070 

0.C025 

0.0124 

0.0012 

0.0000 

0.0330 

3 

0.03S3 

O.UOOI 

0.0002 

0.0052 

0.0001 

0.0004 

0.0001 

4 

0.0006 

0.0149 

0.0077 

0.1875 

0.0023 

0.C0S6 

0.0030 

5 

0.0020 

0.0005 

0.0055 

0.1156 

0.0015 

0.2105 

0.0031 

6 

0.0031 

0.7804 

0.0079 

0.1501 

0.0006 

0.0232 

0 . PI  20 

7 

0.3635 

0.0005 

0.1700 

0.0006 

0.0017 

0.0020 

0.0644 

6 

0.0130 

0.0034 

0.0609 

0.1336 

0.0143 

0.0906 

0.4026 

9 

0.5536 

0.0355 

0.3953 

0.0026 

0.0000 

0.0059 

0.0992 

1C 

0.0216 

0.1506 

0.3472 

0.0005 

0.0317 

0.0901 

0.2503 

11 

0.0054 

0.0006 

0.0002 

0.3920 

0.9459 

0.5776 

0.1599 
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Several  steps  may  be  taken  when  a  point  with  large  values  is 
encountered.  The  first  step  in  dealing  with  these  data  points  is  to 
check  data  entry  for  accuracy.  Next,  the  observations  should  be 
examined  to  ensure  that  they  are  part  of  the  population  of  the  study. 
Observation  25,  97,  and  102  were  scrutinized  in  this  manner  due  to 
large  values  of  "DF3ETAS".  No  data  entry  error  was  noted,  and  the 
observations  belong  to  the  population  outlined  in  Section  2.2.  There 
is  a  possibility  that  the  data  was  recorded  incorrectly  at  the  job 
sight.  The  fate  of  highly  influential  observations  lies  in  the  hands 
of  the  analyst:  they  should  not  be  dropped  without  much  thought  and 
good  reason.  In  the  present  case,  the  observations  were  retained. 

The  final  test  of  the  model  chosen  as  the  "best"  model  is  the 
Extra  Sums  of  Squares  Test.  An  alpha  level  of  0.05  is  chosen  for  this 
test.  The  full  model  consists  of  the  variables  singled  out  in  the 
final  stepwise  regression  step.  The  reduced  model  consists  of  the 
variables  present  in  the  "best"  model.  The  removed  variables  are  being 
tested  for  significance.  The  removed  variables  are  PASSLEN,  K.0 D, 
AREAVOL.  With  ^  -13-10  or  three,  and  v2  =135-13-1  which  equals  121,  the 

F.-2.63.  The  observed  F,  F  is: 

C  *  o 

F=(43021. 12-42398. S3)*121/[ 13013. 3*(13-10)]-1. 39.  Because  F  >F 
u  CO' 

none  of  the  removed  variables  contribute  anything  above  and  beyond 
those  in  the  "best"  model.  Thus  the  ten  variables  tested  in  regression 
run  three  make  up  the  final  or  "best"  model. 

4.4  Testing  FINI5WH 

The  variables  thought  to  have  an  impact  on  the  cement  finishers 
workhours  are  ELSTAND,  TEMP,  KOD,  V0L3R0SS,  V0LK0D,  V0LDAY,  TV0L3R0S , 
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VOLTRUCX,  ASSLEN,  ELASSLEN,  TASSLEN,  PASSLEN,  and  AREAVOL.  Due  to  the 
relatively  low  nunber  of  variables,  a  stepwise  regression  is  not  run. 
Applying  the  various  lack  of  fit  tests  results  in  the  sixth  regression 
run  on  Table  4.3  being  chosen  as  the  "best"  model.  Removal  of  AREAVOL 
may  also  be  considered.  The  small  changes  in  the  MSE,  R2 ,  and  adjusted 
R2  indicate  that  the  variables  removed  are  insignificant.  An  overall 

' 

|  of  0.7826  is  very  good  for  this  type  of  data.  Over  78&  of  the 

variability  in  FINISWH  is  explained  by  these  five  variables. 

|  4J5  TESTING  FIXEDOH 

i 

j  Stepwise  regression  resulted  in  13  variables  meeting  the 

;  significance  level  of  entry  of  0.25.  The  model  with  NEWDAY  and  FINISWH 

’  removed  had  an  R2  of  0.6614.  The  removal  of  PASSLEN  resulted  in  a 

ADJ 

» 

■  drop  of  over  one-tenth  in  the  adjusted  R2 .  PASSLEN  is  thus  left  in  the 

model,  and  no  further  regressions  are  run.  Eleven  variables  remain  in 
the  model.  I't  is  desireable  to  have  less  than  eleven  variables  in  the 
final  model,  but  significant  changes  occur  in  the  three  tests  in  Table 
4.4  with  the  removal  of  the  next  viable  variables,  PASSLEN  then  ECD. 
The  regression  runs  are  included  in  Appendix  F  of  this  thesis. 

4. 6  Summary 

The  model  testing  tools  reviewed  in  this  chapter  provide 

indicators  to  the  lack  of  fit  of  the  model  being  tested;  however  no  one 

test  is  conclusive  in  itself.  3y  combined  use  of  the  tests  a  "best" 
model  can  be  chosen.  This  may  not  be  the  best  model,  indeed  there  may 
not  be  a  best  model  (2:85).  If  blessed  with  an  abundant  amount  of 

data,  the  analyst  is  urged  to  use  accepted  means  of  hypothesis  testing. 


This  is  rarely  the 


case 


in  the  construction 


industry. 


I 
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Table  4.3  Summary  of  Tests  on  FINISWK 


REGRESSION 

VARIABLE 

MSE 

R2 

R2 

ADJ 

RUN 

REMOVED 

1 

- 

125.290 

0.8063 

0.7907 

a 

PRECI? 

124.308 

0.3063 

0.7924 

3 

TEMP AREA 

123.343 

0.8052 

0.7940 

4 

PRECAREA 

123.242 

0.3049 

0.7941 

5 

FLOTAREA 

124.954 

0 • 8005 

0.7913 

*6 

NEWDAY 

127.364 

0.7952 

0.7873 

7 

AREAVOL 

130.127 

0.7891 

0.7326 

Table  4.4 

Summary  of 

Tests  of 

FIXEDOH 

REGRESSION 

VARIABLE 

MSE 

R2 

R2 

ADJ 

RUN 

REMOVED 

1 

- 

15586.113 

0.6980 

0.6556 

2 

NEWDAY 

15664.113 

0.3940 

0.6639 

*3 

FINISWH 

15781.347 

0.S392 

0.6514 

4 

PASSLEN 

16423.050 

0.6739 

0.6476 

*  INDICATES  "BEST"  MODEL 
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CHAPTER  5 


INTERPRETATION  OF  RESULTS 

5.1  Introduction 

In  a  simple,  first-order  model  (no  higher  order  or  interaction 
terms),  a  one  unit  change  in  an  independent  variable  with  coefficient  3 
will  result  in  a  mean  change  in  the  dependent  variable  by  B,  all  other 
variables  held  constant.  Productivity  is  defined  as  output/ input 
(quantity/manhours).  When  the  dependent  variable  is  manhours,  the 
coefficient  of  independent  variables  with  units  of  quantity  of  work 
(cubic  yards  poured,  square  feet  painted,  etc.)  Is  the  inverse  of 
productivity.  It  Is  suggests  these  be  labeled  "unit  time"  when 
manhours  is  the  response  variable,  and  "unit  cost"  when  cost  is  the 
response  variable  (5). 

Multicollinearity  can  cause  very  confusing  results.  If 
multicollinearity  exists  between  two  independent  variables,  each 
variable  may  seem  insignificant  based  on  individual  t-tests,  but  when 
the  F-test  is  used  at  least  one  of  the  two  variables  may  be 
significant. 

When  an  interaction  is  present  in  the  model,  keeping  one  variable 
constant,  both  the  slope  and  the  intercept  change  as  the  second 
variable  in  the  interaction  changes.  The  slope  of  a  variable  depends 
on  the  other  interaction  variable.  For  example,  given  the  model  Y=A 
+B*X+C*Z+D*X*Z,  the  slope  of  Z  equals  C  +D*X  at  a  given  X  (7:199). 


The  coefficients  of  the  transformed  models  represent  rates  of 
change.  By  transforming  the  model  equations,  it  is  possible  to 
identify  the  effects  certain  variables  have  on  the  intercept  and 
ratio-level  variables.  The  intercept  cannot  be  broken  down  into  its 
three  components  (mobilization,  demobilization,  and  contingency  buffer) 
using  the  techniques  proposed. 

An  interpretation  of  the  results  of  the  LA30RWH  will  be  discussed 
in  this  chapter.  Then  the  goals  of  the  study  as  addressed  in  Section 
1.3  will  be  completed.  The  chapter  will  close  with  some  applications 
of  the  results. 

5.2  Interpretation  of  LABORWH 

The  "best"  model  as  identified  in  Chapter  4,  can  be  transformed  to 
ease  interpretation.  The  intercept  term  has  the  transformed  form  of 
28.23*(1  -0.02*TEMP  -0.96*K0D).  Both  ratio-leve1  variables  are  also 
transformed.  VOLGROSS  takes  the  form  0.222*VOLGROSS*(1  +2.64*K0D 
+0.0038*DAY  -0. 018*TEMP  +0.74*TRUKFUMP).  Likewise,  ASSLEN  is 
transformed  as  0.061*ASSLEN*(1  -0.004*EL  +0.039*TEMP ) . 

The  coefficients  of  the  "best"  model  for  the  prediction  of 
LABORWH  identify  interesting  implications  of  the  pouring  operations. 
The  intercept  term  indicates  over  28  hours  of  work  are  required  when  no 
work  is  done  and  operations  are  at  standard  conditions.  A  proportional 
increase  to  the  intercept  is  caused  by  temperatures  below  the  average. 
This  indicates  more  manhours  are  required  for  mobilization, 
demobilization  and  contingencies  when  the  temperature  is  low.  Kodiak 
pours  cause  a  proportional  decrease  in  the  intercept.  This  is  most 


likely  due  to  the  fact  that  Kodiak  pours  are  smaller  in  volume  than  the 


norm.  They  consist  of  areas  which  were  blocked  out  (not  poured  when 
rest  of  slab  was  pou  ed)  to  accomodate  the  tiebacks  which  supported  a 
crane  system. 

The  Kodiak  pours  cause  a  proportional  increase  in  the  overall 
manhours  of  the  pour.  The  Kodiak  pours  are  small,  but  somewhat 
specialized  due  to  their  location.  The  day  of  work  and  fact  that  truck 
pumps  are  being  used  have  the  same  effect.  Work  tends  to  slow  down  as 
the  job  nears  conclusion.  Correcting  punchlist  items  combined  with  the 
fact  that  the  workers  will  be  out  of  work  when  the  job  is  done  may 
explain  this.  Since  concrete  sets  faster  in  warmer  weather,  it  is 
expected  that  manhours  and  temperature  have  a  negative  relation,  which 
is  reflected  in  the  regression  results.  The  proportional  effects  of 
elevation  and  temperature  on  the  assembly  length  are  confusing.  At 
lower  elevations  it  tends  to  require  more  work  to  assemble  the  pipe. 
Equally  confusing  is  the  fact  that  at  higher  temperatures  the  manhours 
used  to  assemble  the  pipe  increase. 

Based  on  the  results  of  the  regression  runs,  managers  have  more 
insight  into  the  construction  process.  Providing  protection  from  the 
environment  is  something  which  should  be  examined.  The  cost  of  the 
protection  may  outweigh  the  savings  in  labor  costs.  Avoiding  the  use 
of  the  truckpump  is  another  item  which  could  save  money.  Other  items 
such  as  the  effects  of  time  and  Kodiak  pours  are  important  for  planning 
purposes.  Schedules  can  be  made  reflecting  these  effects. 

Piets  of  the  regression  residuals  versus  each  of  the  independent 
variables  of  the  final  model  for  LABORWH  appear  in  Appendix  D.  The 
plots  appear  to  be  homoscedastic.  There  are  no  strong  trends.  The 
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Table  5.1  Descriptive  Statistics  of  LABCRWH  Model 


VARIABLE=RESID 


RESIDUALS 


MOMENTS 


M  135 
HEAR  -1.179E-14 
SID  DEV  11.7931 
SKEMNESS  0.546949 
□5S  18636.3 
CV  -1 .  OOOE  +  1 7 
r:3EAN=0  -1.162E-14 
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NON  -=  0 


-280 

135 
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135 
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1 
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-24.2865 
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plot  of  the  residuals  versus  V0L3R0S3  hints  that  less  variability  in 
manhours  appears  with  larger  pours  (greater  than  200  cubic  yards). 
Descriptive  statistics  of  the  residuals  make  it  possible  to  choose  a 
crew  size  for  average  conditions.  By  replacing  the  variable  names  with 
the  desired  (proper)  numerical  values,  the  mean  labor  workhours 
required  can  be  predicted.  Being  the  mean,  roughly  50%  (assuming  a 
nearly  symmetrical  distribution)  of  the  time  more  labor  than  is 
predicted  will  be  required.  Table  5.1  contains  the  descriptive 
statistics  for  the  residuals  of  the  "best"  model  for  prediction  of 
LASORWH.  The  descriptive  statistics  indicate  16.56  manhours  should  be 
added  to  the  mean  prediction  in  order  to  reach  a  90%  confidence  level 
of  completing  the  job  in  the  predicted  time.  The  job  of  the  manager  is 
to  choose  the  confidence  level.  The  additional  manhours  required  to  be 
added  to  the  mean  can  then  be  derived.  The  crew  size  can  be  chosen 
using  the  simple  analysis  described  above. 

5.3  Summary 

There  are  two  major  concerns  of  interpretation  of  the  results. 
The  first  is  one  of  validity.  Are  the  results  useful?  If  not,  all  is 
not  lost.  Important  insights  into  the  operations  can  be  gained  through 
analysis.  The  deficiency  of  a  single,  precise  lack  of  fit  test  is  the 
second  concern. 

Having  found  the  results  valid,  the  construction  manager  has  many 
uses  for  them.  Manhours  can  be  predicted,  decisions  on  methods  of 
construction  can  be  made,  and  crew  size  can  be  chosen.  By  multiplying 
the  manhours  for  both  laborers  and  finishers  by  their  respective  rates 
of  pay,  and  adding  the  cost  of  the  fixed  overhead,  a  cost  estimate  can 


be  derived.  Regression  analysis  can  also  be  used  in  claims  cases. 
Costa  can  be  calculated  for  the  way  a  project  should  have  been.  This 
can  be  compared  to  the  way  it  actually  was.  The  claim  can  then  be 
based  on  this  difference. 


CHAPTER  6 


SUMMARY  AND  CONCLUSIONS 

5. 1  Summary 

In  Chapter  1  of  this  thesis,  four  areas  of  confusion  were 
discussed,  and  the  three  major  goals  of  the  study  were  given.  The  data 
set  for  this  research  was  described  in  Chapter  2.  Chapter  3  dealt  with 
the  formula-ion  of  regression  models.  The  models  were  tested  in 
Chapter  4,  and  interpreted  in  Chapter  5.  The  major  conclusions  of  this 
study  and  recommendations  for  future  research  are  given  hereafter. 

6.2  Major  Conclusions  of  Confusion  Areas 

The  major  conclusions  based  on  the  four  confusion  areas  presented 
in  Section  1.1  of  this  thesis  are: 

1)  Data  aquisition  in  the  construction  industry  needs 
improvement  in  order  to  facilitate  various  analytical 
techniques.  Construction  managers  and  accounting 
personnel  need  to  develop  data  forms  together  to 
satisfy  both  of  their  needs.  It  is  less  expensive,  and 
more  accurate,  to  collect  the  right  data  as  the  job  is 
in  progress  than  to  try  to  extract  it  from  records 
after  the  job  is  complete. 

2)  The  thrust  of  this  study  is  in  the  methodology 
proposed.  A  nine  step  approach  to  the  analysis 
of  construction  data  has  been  introduced:  1.  Data 


collection;  2.  Choose  response  variables;  3.  Break 
independent  variables  into  ratio,  interval,  and 
binary-levels;  4.  Develop  initial  regression  model(s); 
5.  Model  coding;  6.  Identify  independent  variables  in 
coded  form  and  calculate  transforms;  7.  Run  regression, 
find  parameters,  and  select  best  model;  8.  Decode  best 
model;  and  9.  Interpret  decoded  model.  This  nine  step 
approach  provides  a  standard  methodology  in  lieu  of  a 
capricious  approach.  The  regression  results  achieved 
by  this  method  on  the  data  set  used  in  this  study  are 
within  the  region  of  results  one  would  expect  to  find 
in  the  industry. 

3)  Many  of  the  assumptions  of  regression  are  violated 
in  these,  and  most  construction  data.  Because 
construction  data  are  few  and  far  between,  random 
sampling  is  often  not  done.  This  is  the  case  in  this 
study.  Additionally,  the  random  error  associated  with 
any  one  dependent  variable  is  not  always  independent  of 
the  error  associated  with  any  other  point.  The 
question  to  be  resolved  by  the  analyst  is  one  of 
validity;  "Is  regression  analysis  useful  for  a 
particular  construction  process?"  The  violations  of 
assumptions  tend  not  to  loom  as  big  if  the  methodology 
is  valid  (useful). 

4)  The  value  of  regression  analysis  lies  in  the 
interpretation  of  the  results.  Improvements  to  a 


process,  and/or  identifications  of  problem  areas  in  the 
process  can  be  achieved  through  proper  interpretation 
of  the  regression  results.  This  interpretation  can  aid 
in  decision  making.  Descriptive  statistics  of  the 
residuals  of  a  model  provide  for  such  things  as  the 
selection  of  crew  size. 

6.3  Satisfaction  of  Goals  of  This  Study 


The  three  goals  of  this  study  presented  in  Section  1.3  have  been 
satisfied: 

1)  A  Methodology  was  developed  in  this  thesis  and  is 
summarized  by  the  nine  step  approach  presented  in  the 
second  section  of  this  chapter; 

2)  Examples  of  how  one  can  use  the  parameter  estimates 
to  make  such  decisions  as  whether  to  use  weather 
protection  or  not  is  presented  in  Section  5.2;  and 

3)  A  demonstration  of  how  a  manager  can  use  the 
descriptive  statistics  of  the  model  residuals  to  choose 
a  crew  size,  based  on  a  percentile  dictated  by 
management,  is  included  in  Section  5.2  of  this  thesis. 

6.4  Recommendations  for  Future  Research 

Four  areas  offer  potential  for  future  research.  The  first  is  to 
continue  applying  the  techniques  discussed  in  this  study  to  to  other 
data  sets.  An  opportunity  to  analize  data,  then  use  the  results  to 
predict  an  actual  job,  would  provide  for  an  interesting  study. 
Analysis  of  data  from  various  contractors  with  similar  specialties  may 
identify  aspects  common  to  all  the  data.  Modification  of  the 
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methodology  may  improve  the  results  and  validity  of  them. 

Deconfounding  the  intercept  term  provides  the  second  area  of 
future  research.  It  may  only  be  possible  to  accomplish  this  through 
time-lapse  study.  The  time-lapse  equipment  is  available  at  The  Ohio 
State  University. 

The  third  area  for  future  research  is  to  test  and  further  develop 
the  nine  step  approach  introduced  in  this  thesis. 

The  final  suggested  research  subject  is  to  evaluate  the  benefits 
and  desirability  of  using  a  confidence  interval  approach  to  sizing  of 
crews  given  daily  point  estimates  of  manhour  requirements. 
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APPENDIX  A 
DATA  USED  IN  STUDY 

This  section  contains  the  data  used  in  this  study.  Some  variables 
made  available  to  this  writer  are  not  included  in  the  following  list 
because  they  were  not  used  in  this  study.  Additionally,  no  transformed 
values  such  as  AREAVOL  are  listed  in  these  pages. 
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VARIABLE  LABLES 

This  section  labels  or  defines  the  variables  used  in  this  study. 
Due  to  field  length  limitations  in  SAS,  the  labels  are  short  and 


concise 
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Table  B.l  Variable  Labels 


LABEL  AREA=AREA  FINISHED  BT  CERENT  FINISHERS 
ATR=ATRIOH  VECTOR 
KOD=KODIAK  VECTOR 

P00RC0ST=C05T  OF  THE  POOR  IB  THOOSANOS 
DAI=DAT  OF  WORK  HITH  BARCH  8,1903  AS  DAT  1 
EL=ELE¥ATION  OF  POOR  ABOVE  GROUND 
¥0 LG ROSS -GROSS  ¥OLOBE  OF  CONCRETE  POOR 
HINTESP=THE  DAILI  HINIHOH  TEBPERATORE 
ASSLEN=ASSE3BLT  LENGTH  OF  PIPE 
POHPTROK=TRDCK  POHPVECTOR  TIRES  AREA 
TROKPOHP=TROCK  POHP  VECTOR 
PRECIPSPRECIPITATION  VECTOR 
PLACBOGI-BOGGY  VECTOR 
FLOTFIM=FLO AT  FINISH  VECTOR 

FIIEDOH=COST  OF  SUPERVISION .POLICE .OPERATORS 
L A BORN H=LA BORERS  MORE  HOURS 
FINISHB=CEHENT  FINISHERS  NORA  BOORS 
N  EUDATsSTANDARDIZED  DATS 
ELSTANDsSTABD ARDIZED  ELEVATION 
TEHP=STANDARDIZED  HINIHOH  TEBPERATORE 
VOLKOD- VOLGROSS  TIRES  KODIAK  VECTOR 
VOLCRAH-VOLGROSS  TIBES  CRANE  VECTOR 
VOLTRUCK=VOLG ROSS  TIBES  TROCKPOBP  VECTOR 
VO L AT R= VO LG ROSS  TIBES  THE  ATRIOH  VECTOR 
TV0LGR0S-V0LG3055  TIBES  THE  STANDARDIZED  TSBP 
PVOLGRQS= VOLGROSS  TIBES  THE  PRECIP  VECTOR 
VOLBOGI=VOLSROSS  TIBES  THE  BOGGY  VECTOR 
PASSLEN=ASSEBBLT  LENGTH  OF  PIPE  TIBES  PRECIP 
PRECARBASAREA  TIBES  THE  PRECIPITATION  VECTOR 
IEBPAREA-TEBP ERATO RE  TIBES  THE  AREA 
FLOTAREA=ABEA  TIBES  THE  FLOAT  VECTOB 
ELAS5LEN  =EL  TIBES  ASSLEN 
TASSLEN=TEHP  TIBES  ASSLEN 
AREAVOL=AREA  TIBES  VOLGROSS 
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TRADITIONAL  TEST  OF  LABORWH 

C. 1  Introduction 

A  traditional  approach  to  formulating  and  testing  the  LABORWH 
model  is  presented  in  this  section.  The  testing  of  the  model  is 
similar  to  the  testing  of  the  models  discussed  in  Chapter  4  of  this 
thesis;  the  formulation  of  the  models  is  much  different  than  the 
approach  proposed  in  this  study. 

C.2  Hypothesis 

It  Is  thought  that  MINTEMP,  VOLGROSS,  EL,  AREAVOL,  ASSLEN,  PRECIP, 
KOD,  ATR,  PLACBUGY,  AND  TRUXFUMP  contribute  something  to  the  prediction 
of  LABORWH.  The  null  hypothesis,  HQ  :Sq=31  =...BN  =0,  indicates  no 
variables  contribute  to  LABORWH.  The  alternate  hypothesis,  H„:3  *3 

*...3n*0,  indicates  at  least  one  B  is  not  equal  to  zero. 

C.3  Testing  the  Model 

The  first  regression  run  with  all  considered  variables  included 
has  an  R^QJ  *0.5354  and  MSE  of  188.836.  Removal  of  PRECIP  results  in  a 
slight  rise  in  the  R^0J  (0.5867)  and  a  slight  decrease  in  the  MSE 
(188.274).  The  RaQj  lowers  to  0.5829  and  the  MSE  rises  to  189.951  when 
PLACBUGY  is  removed.  These  adjustments  are  small  enough  to  be 

considered  insignificant  however.  After  removing  the  variable  ATR,  the 

2 

Raqj  decreases  a  bit  more,  and  MSE  shows  a  larger  increase.  Hence,  the 
model  with  ATR  included  is  considered  the  best. 
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The  last  test  to  be  used  in  qualifying  this  model  is  the  F-test  on 
the  excluded  variables.  An  alpha  value  of  0.05  results  in  an  ?  of 
3.07.  F0  =(  37619.231  -37099.312)  *124  divided  by  2*23415.504.  Hence, 
?Q  equals  1.37.  Since  Fc  >FQ  ,  it  is  95%  certain  that  none  of  the 
removed  variables  contribute  significantly  to  the  prediction  of 
LA30RWH.  The  final  model  is  a  good  model. 

C.4  Interpretation  of  Results 

The  mobilization/demobilization/contingency  buffer  (intercept)  is 
33.4  hours.  Labor  workhours  decreases  0.22  hours  for  every  degree 
increase  in  temperature.  Maintaining  a  constant  area  of  pour,  the 
manhours  required  increases  with  volume.  Elevation  remains  confusing. 
The  manhours  tends  to  decrease  as  the  elevation  increases.  Thinner 
slabs  at  the  higher  elevations  may  be  an  answer.  Kodiak  pours  tend  to 
require  15  more  hours  to  complete,  and  atrium  pours  nearly  eight  more. 
Using  a  truck  pump  increases  the  time  by  12  1/2  hours  on  the  average. 

C. 5  Summary 

Many  testing  techniques  are  covered  in  the  literature.  The 


testing  methods  used  in  the  body  of  this  study  was  used  in  this 
appendix.  The  "best"  model  should  be  tested  again  with  a  new  set  of 
data,  an  additional  variables  perhaps  added. 
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Table  C.2  Traditional  Approach  Regression  Run 


h  » 

A  O 

co  a 

O  O 

ae  • 

a.  o 


o*-ooa»oo:fr»<n 

owiooooocrtao 

OOOOQO»-Or-a 
•  ••••«•••• 
oooooooooo 


w 

»  r* 

3 

m 

3  O 

•4 

*-  CD 

«* 

• 

vo  in 

> 

(N 

•  • 

<N 

a  o 

W 

tn 

ae 

o 

S 

3  W 

r*  a 

n  a 

*•  as 

a  r* 

as  m 

w  < 

r*  cm 

•»  i 

as 

E  3 

•  • 

3  3 

O 

a 

vo  co 

a 

3 

in 

vo  cn 

in  n 

<n 

CBS 

r- 

i  a 

a* 

«  X 

M 

ae 

o 

s 

w  in 

m  fN  m 

o  m  *- 

o  w 

NOVf- 

O  111  N 

as 

r»  *—  o> 

IN  31  3 

E  < 

•  •  • 

«■*  p>*  n» 

0  3  9 
o  n  m 
Ut  if)  o 
A  (H 

m  «n  us 


rt  \o  • 

in 

•  •  *— 

<n  r» 

r—  03 


<rtf><40vaaof*»s»'aro\p» 
*->Otn<ECM(Ntf133knCN 
OOl  9M)«£NtO«oO 

i  t  i 


tno^mjcioci 

itor*>vorMQornm9 
MNBfMODOnB 
oooo30t-m 
•  •  m  •  •••  « 
oooovotnvocr 
o 


oiflmoniNOinfsm 

r*aniNcB(NBO«yO 

I’1<0300(NOWN£HB 

cn^-cMcnomocrOin 

o(Nr-Or-o»piounffl 

OIN3pOO^Sfll^ 

••••••••«# 

oooooooaoajpn 
mi  I  i  t-  t- 


an  tn  3 
<N  m 


J  «  |H 

WOO 
a  as  h 

o  S 
CHIU 


H  • 
o  a.  > 
o  w  • 

SOCJ 


w  tn  >»  a. 

U  4  in  »J  13  jc 

u  jc  a  o  as  oo 

w  w  oc  »  w  m  o, 

WHO  ■*  _J  U  C 

H  *  «4  W  in  Q  OB  •«  3 

ZNOUKinOH  JC 


Traditional  Approach  Regression  Run 


KOD  1  14.995406  5.818968  2.577  0.0111 
ATR  1  7.778109  5.041615  1.543  0.1254 
TRUKPUHP  1  12.482777  4.485767  2.783  0.0062 


Table  C.4  Traditional  Approach  Regression  Run 
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APPENDIX  D 


LA30RWH  COMPUTER  OUTPUT 

The  regression  runs  used  to  formulate  the  model  for  the  prediction 
of  LABORWH  are  included  in  this  section.  The  final  step  of  the 
stepwise  regression  run  is  presented  first,  followed  by  the  individual 
runs.  Residual  plots  are  included  to  complete  the  output.  The  draft 
model  is:  LABORWH-  INTERCEPT  +NEWDAY  +ELSTAND  +TEMP  +PRFCIP  +K.OD  +ATR 
+PLACBUGY  +TRUXPUMP  +VOLGROSS  +VOLXQD  +VOLDAY  +TVOLGROS  +PVOLGROS 
+VOLCRAN  +VOLBUGY  +VOLTRUCX  +ASSLEN  +ELASSLEN  +TASSLEN  +PAS3LEN 
+AREAVOL .  This  model  transforms  to  :  LABORWH=  INTERCEPT  ( 1+NEWDAY 
+ELSTAND  +TEMP  +PRECIP  +KOD  +ATR  +PLACBUGY  +TRUKPUMP) 
+VOLGROSS* ( 1+V0LK0D  +VOLDAY  +TVOLGROS  +PVOLGROS  +VOLCRAN  +V0L3UGY 
+VOLTRUCK)  +ASSLEN* ( 1+ELASSLEN  +TASSLEN  .+PAS3LEN )  +AREAVOL. 


Table  D.l  I^st  Step  of  Stepwise  Regression  Run 
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HO  OTHER  VARIABLES  NET  THE  0.2500  SIGHIFICAHCE  LEVEL  FOR  EHTRY 


Table  D.2  Regression  Run 
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Table  D.3  Regression  Run 
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Figure  D.6  Plot  of  Residuals  Versus  VOLTRUCX 
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Figure  D.7  Plot  of  Residuals  Versus  AS3LEN 
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